IRIT at INEX: Question Answering Task
نویسندگان
چکیده
In this paper we describe an approach to tweet contextualization developed in the context of INEX QA track. The task is to provide a context up to 500 words to a tweet. The summary should be an extract from the Wikipedia. Our approach is based on the index which includes not only lemmas, but also named entities. Sentence retrieval is based on standard TF-IDF measure enriched by named entity recognition, POS weighting and smoothing from local context.
منابع مشابه
Overview of the INEX 2010 Question Answering Track (QA@INEX)
The INEX QA track (QA@INEX) aims to evaluate a complex question-answering task. In such a task, the set of questions is composed of complex questions that can be answered by several sentences or by an aggregation of texts from di erent documents. Question-answering, XML/passage retrieval and automatic summarization are combined in order to get closer to real information needs. Based on the grou...
متن کاملIRIT at INEX 2012: Tweet Contextualization
In this paper, we describe an approach for tweet contextualization developed in the context of the INEX 2012. The task was to provide a context up to 500 words to a tweet from the Wikipedia. As a baseline system, we used TF-IDF cosine similarity measure enriched by smoothing from local context, named entity recognition and part-of-speech weighting presented at INEX 2011. We modified this method...
متن کاملCooperative XML ( CoXML ) Query Answering at INEX 03
The Extensible Markup Language (XML) is becoming the most popular format for information representation and data exchange. Much research has been investigated on providing flexible query facilities while aiming at efficient techniques to extract data from XML documents. However, most of them are focused on only the exact matching of query conditions. In this paper, we describe a cooperative XML...
متن کاملFlesch and Dale-Chall Readability Measures for INEX 2011 Question-Answering Track
For INEX 2011 QA track, we wanted to measure the impact of two generic measures of readability in the selection of sentences related to topics. This is a step towards adaptive information retrieval approaches that take into account the reading skills of users and their level of expertise. We show that Flesch and Dale-Chall measures do not allow to filter sentences for obtaining a satisfactory r...
متن کاملIRIT @ TRECVid HLF 2009 Audio to the Rescue
This notebook paper describes the six runs submitted for the first participation of IRIT at TRECVid 2009 High-Level Feature Extraction task. They were submitted in an attempt to start answering two research questions: 1. Can acoustic information be of any help in this (historically) video-only task? 2. Are Support Vector Machines robust enough to deal with noisy and unbalanced datasets? The six...
متن کامل